Matching Numeric Strings under Noise

نویسندگان

  • Veli Mäkinen
  • Gonzalo Navarro
  • Esko Ukkonen
چکیده

Abstra t. Numeri string is a sequen e of symbols from an alphabet U, where U is some numeri al universe losed under addition and subtra tion. Given two numeri strings A = a 1 a m and B = b 1 b n and a distan e fun tion d(A;B) that gives the s ore of the best (partial) mat hing of A and B, the transposition invariant distan e is min t2U fd(A + t; B)g, where A + t = (a 1 + t)(a 2 + t) : : : (a m + t). The orresponding mat hing problem is to nd o urren es j ofA inB where d(A+t; B j 0 :::j ) is smaller than some given threshold and B j 0 :::j is a substring of B. In this paper, we give e ient algorithms for mat hing numeri strings with and without transposition invarian e under noise; we onsider distan e fun tions d(A;B) su h that symbols a 2 A and b 2 B an be mat hed if jb aj Æ, or the largest di eren es jb aj an be dis arded.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Order-preserving matching

We introduce a new string matching problem called order-preserving matching on numeric strings where a pattern matches a text if the text contains a substring whose relative orders coincide with those of the pattern. Order-preserving matching is applicable to many scenarios such as stock price analysis and musical melody matching in which the order relations should be matched instead of the str...

متن کامل

On Representations of Ternary Order Relations in Numeric Strings

Order-preserving matching is a string matching problem of two numeric strings where the relative orders of consecutive substrings are matched instead of the characters themselves. The order relation between two characters is a ternary relation (>,<,=) rather than a binary relation (>,<), but it was not su ciently studied in previous works [5, 7, 1]. In this paper, we extend the representations ...

متن کامل

A system for recognizing numeric strings from topographical maps

This paper proposes a system for recognizing numeric strings from topographical maps, which is composed of the automatic recognition stage and the interactive recognition stage. In this method, uncertain numeric strings extracted through the automatic recognition stage based on topographical map feature only, are confirmed and corrected by the interactive recognition stage. Therefore we can obt...

متن کامل

Meter Numeric Character Recognition Using Illumination Normalization and Hybrid Classifier

In this paper, we propose an improved numeric character recognition method which can recognize numeric characters well under low-illuminated and shade-illuminated environment. The LN(Local Normalization) preprocessing method is used in order to enhance low-illuminated and shade-illuminated image quality. The reading area is detected using line segment information extracted from the illumination...

متن کامل

Performance Evaluation of Local Detectors in the Presence of Noise for Multi-Sensor Remote Sensing Image Matching

Automatic, efficient, accurate, and stable image matching is one of the most critical issues in remote sensing, photogrammetry, and machine vision. In recent decades, various algorithms have been proposed based on the feature-based framework, which concentrates on detecting and describing local features. Understanding the characteristics of different matching algorithms in various applications ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003